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ABSTRACT 

Detailed characterization of exoplanets has begun to yield measurements of their atmospheric prop¬ 
erties that constrain the planets’ origins and evolution. For example, past observations of the dayside 
emission spectrum of the hot Jupiter WASP-I2b indicated that its atmosphere has a high carbon-to- 
oxygen ratio (C/0 > 1), suggesting it had a different formation pathway than is commonly assumed 
for giant planets. Here we report a precise near-infrared transmission spectrum for WASP-I2b based 
on six transit observations with the Hubble Space Telescope/'Wide Field Camera 3. We bin the data 
in 13 spectrophotometric light curves from 0.84 - 1.67 /rm and measure the transit depths to a median 
precision of 51 ppm. We retrieve the atmospheric properties using the transmission spectrum and find 
strong evidence for water absorption (7 a confidence). This detection marks the first high-confidence, 
spectroscopic identification of a molecule in the atmosphere of WASP-12b. The retrieved 1 a water 
volume mixing ratio is between 10“^ — 10“^, which is consistent with C/0 > 1 to within 2cr. How¬ 
ever, we also introduce a new retrieval parameterization that fits for C/0 and metallicity under the 
assumption of chemical equilibrium. With this approach, we constrain C/0 to at 1 cr and rule 

out a carbon-rich atmosphere composition (C/0> 1) at > 3cr confidence. Further observations and 
modeling of the planet’s global thermal structure and dynamics would aid in resolving the tension be¬ 
tween our inferred C/0 and previous constraints. Our findings highlight the importance of obtaining 
high-precision data with multiple observing techniques in order to obtain robust constraints on the 
chemistry and physics of exoplanet atmospheres. 

Subject headings: planets and satellites: atmospheres — planets and satellites: composition — planets 
and satellites: individual: WASP-I2b 


1. INTRODUCTION 

The chemical composition of a planetary atmosphere 
provides a rich record of the planet’s formation condi¬ 
tions and evolutionary history. Measurements of the 
composition can constrain the planet’s formation mech¬ 
anism, its formation location in the protoplanetary disk, 
the surface density and composition of planetesimals at 
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spheric chemical composition, a large sample size of plan¬ 
ets is required to develop a comprehensive theory of giant 
planet formation. Fortunately, the sample of known ex¬ 
trasolar giant planets is large and growing, and recent 
observations of these planets have begun to yield ba¬ 
sic constraints on their atmospheric chemistry. These 
include inferences of the carbon-to-oxygen ratio (C/0) 
and absolute water abundance, which in some cases rival 
our knowle dge of those quantities for the Solar System 
plane ts (e.g. Madhusudhan et al.|2011b [Konopacky et ah ' 


2013 Line et af 2014||Kreidberg et al.|2014a[|Hrogi et al. 

2014 Madhusuc han et al.||2014D). 


Une of the best studied exoplanet atmospheres is that 
of the transiting hot Jupiter WASP-12b. This planet is 
a 1.4Mjup, 1.8i?jup gas giant o rbiting a late-F ho st star 


with a period of just 1.1 days (Hebb et al. 20091. The 
brightness of the host star (H=10.2), the planet^s high 
equilibrium temperature (2500 K), and the planet’s large 
size make WASP-12b a favorable target for atmosphere 
characterization. The system has been observed exten¬ 
sively from the ground and space to measure the planet’s 
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An intriguing possibility has emerged from these ob¬ 
servations of WASP-12b, which is that the planet has 
a carbon-rich atmospheric compo sition (C/0 > 1, com - 
pared to the solar value of 0.55; Asplund et al. 2009 ). 
The high C/0 interpreta tion was first suggested by|Mad- 
husudhan et al. (2011b) as the best explanation for the 
planet’s dayside emission sp ectrum. This result was con¬ 
tested by su bsequent work (Crossfield et al. 2012 Line 
et al. |2014); however, the most recent comprenensive 
analysis of th e dayside spectrum reaff irmed the inference 
of high C/0 dStevenson et al.||2014b|). 

The claim of high C/U m WAIs]r’-12b’s atmosphere 
has motivated substantial theoretical and observational 
work. This includes attempts to measure C/0 for addi - 
tional planets (e.g. Brogi et al. 2014 Line et al. 2014), 
studies of the effect of C/O on atmospheric chemistry 
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atmosphere on WASP-12b is not definitive. The high 
C/O inference is based primarily on the photometric 
secondary eclipse depth from Spitzer at 4.5 /im. The 
low brightness temperature for the planet in this band¬ 
pass is best explained by absorption from carbon-bearing 


species (either CO, HCN, or C 2 H 2 ; Madhusudhan et al. 
2011bl|Kopparapu et al.|2012[ [Moses et al.|2U13[|!Steven- 

son et al.|2014cp, but photometry alone cannot uniquely 

identity which molecule is the dominant absorber. No 
other molecular features have been confidently identi¬ 
fied in the emission spectrum, and measurements of the 
planet’s transmission spectrum h ave yielded even less 
conclusive constraints on the C/O (Sing et al.|2013| Man- 


dell et al. 120131 ISwain et aL|2013[[S'tevenson et af.|2U14c[ 


Madhusudhan et al.|2014b ).' High-precision spectroscopy 

is thus needed to obtain unambiguous determination of 
the atmospheric composition of this important planet. 

In this work, we report a new, precise measurement of 
WASP-12b’s transmission spectrum over the wavelength 
range 0.84 to 1.67 pja. The outline of the paper is as 
follows. In §[^we present the observations and data re¬ 
duction. The light curves fits and measurement of the 
transmission spectrum are outlined in §1^ We compare 
our results to previous measurements in ^ We describe 
a retrieval of the planet’s atmospheric properties based 
on this transmission spectrum in §[^ and discuss impli¬ 
cations for the chemical composition in §1^ We conclude 

in §0 


2. OBSERVATIONS AND DATA REDUCTION 
2.1. Observations 

We obtained time series spectroscopy during six tran¬ 
sits of WASP-12b between UT 1 January and 4 March 
2014 using the Wide Field Camera 3 (WFC3) IR detec¬ 
tor on the Hubble Space Telescope {HST) as part of HST 
GO Program 13467. Three of the transit observations 
used the G102 grism, which provides spectral coverage 
from 0.82 — 1.12 /rm, and the other three used the G141 
grism, which spans the range 1.12 — 1.65 p,m. The G141 
grism has been widely used for exoplanet transit spec¬ 
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Fig. 1.— Raw HST/WFCS images taken with the G102 grism. 
Spatial scan and staring mode data are shown in the top and bot¬ 
tom panels, respectively. The images are cutouts from a 256x256 
pixel subarray. The spectrum of WASP-12A’s binary companion 
is visible in the bottom panel near row 10. 


troscopy, but these observations are the first to use the 
G102 grism for this purpose. The additional wavelength 
coverage from the G102 grism provides access to features 
from more molecules and absorption bands, giving us 
greater leverage in constraining the atmospheric compo¬ 
sition of the planet. Each transit observation (called a 
visit) consisted of five consecutive, 96-minute HST or¬ 
bits. WASP-12 was visible during approximately 45 min¬ 
utes per orbit and occulted by the Earth for the remain¬ 
der of the time. We took a direct image of the target 
with the F126N narrow-band filter at the beginning of 
each orbit for wavelength calibration. Example staring 
mode and spatial scan images are shown in Figure 

The spectroscopic data were obtained in spatial scan 
mode with the 256x256 subarray, using the SPARSIO, 
NSAMP=16 readout pattern, which has an exposure 
time of 103.1s. The scan rates were 0.04 and 0.05 arc- 
sec/second for the G102 and G141 grisms, respectively. 
The spectra extend roughly 40 pixels in the spatial direc¬ 
tion, with peak per-pixel counts below 25,000 electrons 
for both grisms. We alternated between forward and 
reverse scanning along the detector to decrease instru¬ 
mental overhead time. This setup yielded 19 exposures 
per orbit and a duty cycle of 74%. 

In addition to the spatial scan data, we also obtained 
10 staring mode spectra in each grism during the first 
orbit of the first visit. WASP-12 is a triple star system: 
WASP-12 A hosts the planet, and WASP-12 BG is an M- 
dwarf binary separated from WASP-12 A by about 1” 
(|Bergfors et al.||2013||Bechter et al.||2014|). We use these 
staring mode data to resolve the spectruni of WASP-12 A 
from WASP-12 BC, enabling us to correct for dilution to 
the planet’s transit light curve due to the binary. The 
detector orientation was set to 178.7° in order to spatially 
separate the spectra. At this orientation, the spectra are 
separated by 7.9 pixels (compared to the FWHM of 1.1 
pixel at 1.4 fim). We describe the dilution correction in 
detail in § |3.3| 

We also obtained photometric monitoring of the 
WASP-12 system to search for stellar activity. We ac¬ 
quired 428 out-of-transit R-band images over the 2011- 
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TABLE 1 

Summary of Photometric Observations for WASP-12 


Season 

^obs 

Date Range 
(HJD - 2,400,000) 

Sigma 

(mag) 

Seasonal Mean 
(mag) 

2011-2012 

126 

55824-56059 

0.00325 

-0.28048 ± 0.00029 

2012-2013 

169 

56189-56439 

0.00297 

-0.28119 ±0.00023 

2013-2014 

133 

56463-56799 

0.00316 

-0.28253 ±0.00027 



Fig. 2.— Nightly Cousins il-band photometric observations of 
WASP-12 over three observing seasons (points). The data are dif¬ 
ferential magnitudes relative to the average brightness of 17 com¬ 
parison stars. The differential magnitudes have a mean of -0.281 
and a standard deviation of 0.003. The horizontal dashed line in¬ 
dicates the mean brightness from the first observing season. The 
vertical dotted lines span the time range of the HST observations. 


2012, 2012-2013, and 2013-2014 observing seasons with 
Tennessee State University’s Celestron 14-inch auto¬ 
mated imaging telescope. We tested for variability 
from star spots by calculating differential magnitudes for 
WASP-12 relative to 17 comparison stars (shown in Fig¬ 
ure]^ . The standard deviation of the differential magni¬ 
tudes is 0.3%, which is comparable to the photon noise 
for the data. The brightness of the star increases by ap¬ 
proximately 0.001 mag per year. There are no significant 
periodicities between 1 and 200 days. To calculate the 
impact of star spots on our transmission spectrum, we 
took 0.3% as an upper limit for the variability of WASP- 
12. Variability of this amplitude could be produced by 
star spots 300 K cooler than the star’s effective temper¬ 


ature (6300 K; Hebb et al. 
photosphere . Ba sed on the 


2009), covering 3% of the 


ormalism outlined in Berta 


et al. (2011) andlDesert et al. (2011), we calculated that 


the niaximum variation in transit depth due to spots with 
these properties is of order 10“®, which is below the pre¬ 
cision of our transit depth measurements. 


2.2. Data Reduction 

We reduced the H ST/WFC3 data using th e custom 
pipeline described in Kreidberg et al. (2(H4b). Exam¬ 
ple extracted spectra are shown in h igure[3pWe bin the 
spectra into 24- and 15-pixel wide channels (for G102 and 
G141, respectively) to obtain a total of 13 spectrophoto- 
metric light curves at resolution R = A/AA = 15 — 25. 
We also sum the spectra over the full wavelength range to 
create broadband (“white”) light curves for each grism. 
The uncertainty in the flux per exposure is determined 



Fig. 3.— Example extracted spectra (solid lines). The wave¬ 
length ranges covered by the transmission spectra are indicated 
with dashed and dotted lines (for the G102 and G141 data, respec¬ 
tively). The uncertainties on the spectra are smaller than the plot 
linewidth. 


by adding in quadrature the photon noise, the read noise 
(22 electrons per differenced image), and the error in the 
estimate of the background, which we determine from 
the median absolute deviation of the flux values for the 
background pixels. 

We also extract spectra from the staring mode data to 
determine the wavelength-dependent flux ratio of WASP- 
12 BG to WASP-12 A. The spectra are spatially sepa¬ 
rated on the detector, so to extract them we fit a dou¬ 
ble Gaussian model to each column of the final detector 
read. We obtain final estimates and uncertainties for the 
spectra of WASP-12 A and WASP-12 BC by taking the 
mean and standard deviation of the spectra from all 10 
exposures. These extracted spectra are each assigned a 
wavelength solution using the centroids of the stars in 
the direct image. 


3. LIGHT CURVE FITS 
3.1. Broadband Light Curves 

We fit the broadban d light curves with a transit model 
(Mandel & Agol|2002 ) and an analytic function to correct 
for instrument systematics. Transit light curve obser¬ 
vations with HST /WFG3 have several well-documented 
systematic trends in flux with time, includ ing visit-long 
slopes and orbit-long exponential ramps (Berta et al. 


20121 ISwain et al.]|2013| [Kreidberg et al.||2U14b| [bteven- 


son et al. 2014a|). The first orbit of a visit has a larger 

amplitude ramp than subsequent orbits, so we follow es¬ 
tablished practice and do not use this initial orbit in the 
light curve fits. We also discard the first exposure from 
the remaining orbits, which improves the fit quality. We 
fit the remaining data (216 exposures for each grism) 
with a systematic s model based on t he model-rEunp pa¬ 
rameterization of Berta et al.| (2012). The model M{t) 
has the form: 
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M{t) =T{t) X {cS{t) + V tv) 

X (1 - exp(-atorb - £>(t)))- (1) 

In our fits, T{t) is the transit model, or relative stel¬ 
lar flux as a function of time t (in BJDtdb)- The free 
parameters for the transit model are the planet-to-star 
radius ratio fc, a linear limb darkening parameter it, the 
ratio of semi-major axis to stellar radius a/Rs, the or¬ 
bital inclination i, and the time of mid-transit Tq. The 
light curves have poor coverage of the planet’s ingress, so 
we put priors on a/Rs and i to enable the measurement 
of Tq. We use Gaussian priors with mean and standard 
deviation 2.91 ± 0.02 and 80.56 ± 0.03° (for a/Rg and 
i, res pectively), based on est imates of those parameters 
from Stevenson et al. (2014c I. We use an orbita l period 
P = 1 .0914 24 days and assume a circular orbit (Campo 


et al. 


20111. The data from the two grisms are ht sep¬ 


arately, but for each grism the three transits are fit si¬ 
multaneously. We fit unique values of k and Tq to each 
transit, but tie the values for u, a/Rg, and i over all the 
transits. 

We fit the instrument systematics with a constant nor¬ 
malization term c, a scaling factor S{t), a yisit-long lin¬ 
ear slope V, and an orbit-long exponential ramp with rate 
constant a, amplitude b, and delay D{t). The timescale 
tv corresponds to time relatiye to the expected transit 
midpoint for each visit, and torb is time since the first 
exposure in an orbit (both in BJDtdb)- The scaling fac¬ 
tor S(t) is equal to 1 for exposures with forward spatial 
scanning and s for reverse scanning; this accounts for 
a small offset in normalization between the scan direc¬ 
tions caused by t he upstream-downstream effect of the 
detector readout (McCullough & MacKenty 20121. The 
function D{t) is equal to d tor times t during the Hrst 
fitted orbit and 0 elsewhere. A negative value for d im¬ 
plies that the ramp amplitude is larger in the first orbit 
than in subsequent orbits. The parameters u, a, b, and 
d were constrained to the same value for all the tran¬ 
sits, whereas k, s, c, and v were allowed to vary between 
transits. 

There are a total of 21 free parameters in the fits 
to each grism’s broadband light curve. We estimated 
the parameters and their uncertainties with a Markov 
chain Monte Carlo (MC MC) fit to the data, using th e 
emcee package for Python (Foreman-Mackey et al.|2013). 
For the best-fit light curves, we calculated the Durbin- 
Watson statistic to test for time-correlated noise. The 
values were 1.77 and 2.17 for the G102 and G141 white 
light curves, indicating that the residuals are uncorre¬ 
lated at the 1% significance level. The reduced chi- 
squared values {xD for the best-fit light curves are 1.45 
and 1.34, and the residuals are 164 and 115 ppm (37 and 
25% larger than the predicted photon-l-read noise) for 
the G102 and G141 data, respectively. 

To obtain conservative estimates of the errors for 
the white light curve transit parameters, we redid the 
MCMC analyses with the per-exposure uncertainties 
scaled by a constant factor (1.20 and 1.16) chosen to yield 
Xy = 1. We report the transit times from the white light 
curve fits in Table[2] These values extend t he baseline 
of pryise tra. nsit times by two years from the Stevenson] 
et al. (2014c) measurements and will aid in testing the 


TABLE 2 
Transit Times 


Observation Start 
(UT) 

rp a 

(BJDtdb) 

Uncertainty 

(lo-) 

Jan 01 2014 

6659.07598 

3.4E-4 

Jan 16 2014 

6674.35560 

2.8E-4 

Feb 05 2014 

6694.00161 

2.9E-4 

Feb 15 2014 

6703.82417 

2.9E-4 

Mar 02 2014 

6719.10428 

3.4E-4 

Mar 04 2014 

6721.28692 

3.4E-4 


^ We report the time of central transit Tc in BJDtdb 
- 2,450,000. 


possibility of perihelion preces sion for this system (first 
studied by Campo et al. 2011). The white light curves 
are consistent with the priors on a/Rg and i but do not 
yield improved values for those parameters due to the 
poor phase coverage of the transit ingress. 


3.2. Spectroscopic Light Curves 
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Fig. 4.— An example light curve fit to the 1.320 — 1.389 pm 
spectroscopic channel from the G141 grism. The top panel shows 
the best fitting model light curve (black line), overlaid with the 
syatematics-corrected data (points). Residuals from the light curve 
ht are shown in the middle panel. The bottom panel shows a nor¬ 
malized histogram of the residuals compared to a Gaussian proba¬ 
bility density with mean zero and standard deviation equal to the 
predicted photon+read noise (252 ppm). 


We fit the spectroscopic light curves with a similar 
model as we use for the broadband data. The only dif¬ 
ferences are that we hold a/Rg and i fixed to the prior 
mean values and fix the mid-transit times to the white 
light curve best fit values. We also solve for a constant 
rescaling parameter for the photometric uncertainties in 
each spectroscopic channel to ensure that the reduced 
X^ for the light curve hts is unity. We chose to fix a/Rg, 
i, and Tc because they impact the mean spectroscopic 
transit depth only, not the relative depths. Changes in 
the mean transit depth do not significantly affect our 
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retrieval results, as the planet-to-star radius ratio is in¬ 
cluded as a free parameter in the atmospheric retrieval. 

We fit each of the spectroscopic channel light curves 
independently. We achieve nearly photon-limited pre¬ 
cision: the median rescaling factor for the photomet¬ 
ric uncertainties is 1.1. The light curves do not exhibit 
statistically signihcant time-correlated noise, based on a 
Durbin-Watson test at the 1% significance level. An ex¬ 
ample spectroscopic light curve fit for the 1.320 — 1.389 
/rm channel from the G141 grism is shown in Figure]^ 
We show a pairs plot of the ht parameters for the same 
channel in Figurej^ 

We tested an analytic model for the systematics that 
included a quadratic term for the visit-long trend. Pre¬ 
vious analyses of WFC3 data have suggested that a 
quadratic model p roduces better light curve fits (Steven¬ 
son et al. 2014a). However, for this data set we hnd 
that the quadratic model is disfavored according to 
the Bayesian Information Criterion (BIC), with typical 
A BIC values of 10 compared to the linear model. In any 
case, the main consequence of adding a quadratic term is 
to shift the transmission spectrum up or down. This af¬ 
fects our estimate of the planetary radius, but the effect 
is small relative to the error introduced by uncertainties 
in the stellar radius. 

We also explored modeling the instrument systemat¬ 
ics with the divide-white technique, which h as been 
applied successfully to several other data sets (Steven¬ 
son et al. |2014c| IKreidberg et al.|[2014b[ [Knutson et al. 
2014p. This metnod assumes the systeniatics are inde¬ 
pendent of wavelength. For the WASP-12 data, however, 
this assumption is not appropriate. WFC3 instrument 
syste matics are known t o depend on detec tor illumina¬ 
tion (Berta et al. 2012 Swain et al. 2013), and in our 
data the mean pixel hiience varies by 3U7o between the 
spectroscopic channels (see Figurej^. 

To illustrate the dependence of tne systematics on the 
illumination level, we show in Figure!^ the systematics 
decorrelation parameters from the anmytic model as a 
function of light curve normalization c. The value c rep¬ 
resents the baseline flux level in each spectroscopic light 
curve. This value is directly proportional to the mean de¬ 
tector illumination in the channel. We note several qual¬ 
itative trends in the decorrelation parameters: with in¬ 
creasing illumination, the visit-long slope decreases, the 
delay term increases toward zero, and the ramp rate in¬ 
creases. 

The upshot of these trends is that there are residual 
systematics for the divide-white light curve fits that 
are correlated with detector illumination. We therefore 
only report the results for the analytic model. 

3.2.1. Limb Darkening Models 

We tested fixing the limb darkening to values predicted 
by stellar models. We used theoretical quadratic limb 
darkeni ng coefficients from both PHOENIX and Kurucz 
models (Hau schildt et all|1999 [ Castelli fc Kurucz]|2004 1 


generated for Stevenson et al. '(|2U14c ). i'hese theoretical 


coefficients yielded lower quality light curve fits than we 
obtained from empirically estimating a linear limb dark¬ 
ening parameter. For both PHOENIX and Kurucz model 
coefficients, the light curve residuals exhibited systematic 
trends near ingress and egress, and the typical reduced 
chi-squared values increased. The poor match of theo¬ 


retical limb darkening coefficients to our data may arise 
from inaccurate assumptions about the stellar composi¬ 
tion, or inaccuracies in the models themselves. Similar 
disagreement with model limb darkening has been s een 
for other high quality data (e.g. Knutson et al.|[2007 |. 

Since incorrect limb darkening coefficients can intro¬ 
duce systematic bias in the measured transit depths, we 
chose to estimate the limb darkening coefficients directly 
from the data. Our light curves are not precise enough to 
constrain a two-parameter limb darkening model, so we 
instead fit for a single linear parameter in each channel. 
The data are sufficiently precise to distinguish between 
the transit depth and the limb darkening coefficient, as 
evidenced by the lack of correlation between those pa¬ 
rameters in the pairs plot shown in Figurej^ 

3.3. Correction of Dilution from Stellar Companions 

The transit light curves are affected by dilution from 
WASP-12 A’s two companion stars, WASP-12 BC, and 
from the planet’ s nightside emission. Following Steven-| 
son et al. (2014c), we calculated a corrected transit depth 
6' tor each spectroscopic channel using the formula 


S — ^(f 4“ O^Comp 4“ 


( 2 ) 


where <5 is the measured transit depth, acomp is the ra¬ 
tio of flux from WASP-12 BC to WASP-12 A, and ap is 
the ratio of the planet’s nightside emission to the flux of 
WASP-12 A. We calcula ted ap using the same model as 
Stevenson et al. (2014c). We determined acomp empiri- 
cally using tlie spectra extracted from the staring mode 
observations. Each visit is assigned a unique acomp value 
to account for differences in the orientation of the spectra 
on the detector. We plot the dilution factor as a function 
of wavelength in Figure]^ (calculated by interpolating 
the companion spectrum to the same wavelength scale 
as the WASP-12 A spectrum). The measured dilution is 
rough l y 10% larger than the values used by Stevenson 
et al. (2014c); this difference could result from system¬ 


atic uncertai nty in the aperture phot ometry used to scale 
the dilution ( Stevenson et al.|[2014c ). 

To obtain hnal transit depths, we take the weighted 
average of the corrected transit depths from each visit. 
Uncertainties in the dilution factor corrections are prop¬ 
agated through to the final transit depth uncertainties. 
We report the corrected transit depth measurements in 
Table[^ and we show the transmission spectrum in Fig¬ 
ure]^ The corrected transit depths are consistent be¬ 
tween the visits, indicating that the estimated uncertain¬ 
ties are appropriate, and confirming that stellar activity 
does not significantly influence the measured spectrum. 

4. COMPARISON WITH PREVIOUS RESULTS 

There are several other high-precision transit depth 
measurements for WASP-12b in addition to the WFC3 
spectrum we report here. The most precise of these are 
spectroscopy betw een 0.34 and 0.94 ^m from HST /STIS 
(|Sing et al.)|2013|), spectros copy between 0.73 and 1 .00 
/im from Cemini /CMOS (Stevenson et al.||201 4c|), a 
starin g mod e HST /WFC3 C141 spectrum from |Swam 
et al. (2013), a nd photometry at 3.6 and 4.5 from 
Spitzer/IRAC (|Cowan et ai.||2012|). Figure^ shows 
past optical/near-lK transmission spectrum measure¬ 
ments compared to our results. 
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Fig. 5.— A pairs plot showing distributions of MCMC fit parameters for the 1.320 — 1.389 fim light curve. The off-diagonal panels show 
marginalized posterior probability for pairs of parameters, with 1, 2, and 3cr credible intervals indicated with black contours. The gray 
shading corresponds to probability density (darker for higher probability). The panels on the diagonal show distributions of each parameter 
marginalized over the other model parameters, with the median and 68% credible interval marked with dashed lines. The planet-to-star 
radius ratio, /c, is not strongly correlated with any of the other fit parameters. For parameters that are allowed to vary between transit 
observations (/c, c, s, and u), we show distributions for the 16 Jan 2014 transit. 
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c ( X ) 


TABLE 3 

Transit Fit Parameters 


Bandpass^ 

(Rp/R*)"*’ 

Limb- 

rms 

{pm) 

(%) 

darkening u 

(ppm) 

0.838 - 

0.896 

1,4441 ± 0.0069 

0.27 ± 0.02 

349 

0.896 - 

0.954 

1.4422 ± 0.0055 

0.28 ± 0.01 

300 

0.954 - 

1.012 

1.4402 ± 0.0052 

0.27 ± 0.01 

296 

1.012 - 

1.070 

1.4428 ± 0.0051 

0.26 ± 0.01 

301 

1.070 - 

1.129 

1.4391 ± 0.0053 

0.25 ± 0.01 

299 

1.112 - 

1.182 

1.4386 ± 0.0047 

0.26 ± 0.01 

267 

1.182 - 

1.251 

1.4365 ± 0.0045 

0.26 ± 0.01 

267 

1.251 - 

1.320 

1.4327 ± 0.0041 

0.21 ± 0.01 

253 

1.320 - 

1.389 

1.4582 ± 0.0040 

0.22 ± 0.01 

242 

1.389 - 

1.458 

1.4600 ± 0.0043 

0.18 ± 0.01 

255 

1.458 - 

1.527 

1.4530 ± 0.0045 

0.20 ± 0.01 

270 

1.527 - 

1.597 

1.4475 ± 0.0058 

0.16 ± 0.02 

332 

1.597 - 

1.666 

1.4332 ± 0.0055 

0.16 ± 0.02 

321 


^ The measurements between 0.838 - 1.129 fim are from the 
G102 grism; those from 1.112 - 1.666 fim are from the G141 
grism. 


^ Transit depths corrected for dilution from companion stars 
and planet nightside emission. The light curve fits had a/Rs 
and i fixed to 2.91 and 80.56°, respectively. 


Fig. 6. — Systematics decorrelation parameters from the light 
curve fits plotted as a function of normalization c. We show (from 
top to bottom) the visit-long slope v, ramp rate constant a, ramp 
amplitude b, and ramp delay d. The normalization constant c is 
linearly proportional to the per-pixel flux in a spectroscopic chan¬ 
nel. Measurements from the G102 and G141 data are shown in 
light and dark gray, respectively. The error bars indicate 1 a un¬ 
certainties from an MCMC fit to the light curves using the analytic 
model for instrument systematics. 



Fig. 7. — The dilution factor ocompi determined from the ratio of 
flux from WASP-12 BC to WASP-12 A. The median dilution (over 
the 10 staring mode observations) is shown with black lines (dotted 
for G102 and dashed for G141). The gray shaded region indicates 
1(T uncertainty determined from the median absolute deviation. To 
calculate the dilution and uncertainty, we interpolate the spectra 
from each grism onto a common wavelength scale. 


Our transit depth measurements are consisten t with 


the staring mode spectrum from HST/WFC3 (Swain 

et al.||2013 

Mandell et al. 

2013 

Stevenson et al.||2U14c), 


modulo a constant ottset in the absolute transit depths. 

We chose not to incorporate this data set in our analy¬ 
sis, however. The staring mode data has a low duty cycle 
and increases the amount of in-transit exposure time by 
just 10% over the total from our three spatial scan ob¬ 


servations. Morever, there is uncertainty about the best 
instrument system atics model for this data set (Steven¬ 
son et al. 2014c). Since the marginal improvement m 


measurement precision is counteracted by increased sys¬ 
tematic uncertainty, we chose to focus on the transmis¬ 
sion spectrum derived from the spatial scan data only. 

Our spectrum agrees less well with the gro und-based 
transmission spectrum from Gemini/GMOS (Stevenson 
et al [20140 1. To compare the WFC3 and CMOS results, 
we computed values for the data using the best fit 
mode l spectrum from the FREE retrieval (described in 
§ 5.1) over the wavelength range where the spectra over- 
lap (0.86 - 1.00 ^m). We allowed a free offset in the 
model for the GMOS data to account for differences in 
the absolute measured transit depths. We find that the 
GMOS data are inconsistent with the model at > 4cr 
(x^ = 38.7 for 9 degrees of freedom). Based on the track 
record of precise, reproducible transmission spectr a from 
WFC3 that are well-fit by theoretical models (e.g. Krei- 
dberg et al.||2014b|at |Stevenson et al.||2014d|, we trust 
the reliability of the Wh'G3 measurements over those 
from GMOS. Ground-based transit observations exhibit 
strong time-correlated noise that makes estimating accu¬ 
rate confidence intervals challenging. Additional GMOS 
transit spectroscopy observations would help identify the 
number of repeated observations with this instrument 
needed to make robust measurements. 

The Spitzer transit depths measurements are approxi¬ 
mately 0.1% smaller than the mean WFC3 transit depth. 
Based on the random errors alone, this difference is sig¬ 
nificant. However, the Spitzer data have significant sys¬ 
tematic uncertainties, because the relative transit depths 
measured with IRAC channels 1 and 2 can shift by 
> 0.1% depending on the aperture size used in the data 
reduction. There i s no obvious optimal c hoice of aper¬ 
ture (see Figure 17, Stevenson et al. 2014c). We therefore 
chose not to incorporate the Spitzer results in our anal¬ 
ysis. 


5. RETRIEVAL OF THE ATMOSPHERIC PROPERTIES 
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Fig. 8. — The transmission spectrum of WASP-12b measured with HST /WFC3 (points). The error bars on the transit depths are 1 cr 
uncertainties from an MCMC fit to the light curves. We show the best fit model binned to the resolution of the data (blue squares). The 
shaded regions indicate 1- and 2 a credible intervals in the retrieved spectrum (medium and light blue, respectively), relative to the median 
fit (dark blue line). The secondary y-axis labels indicate the atmospheric scale height. One scale height corresponds to 470 km (for a 
temperature of 1400 K). The increase in transit depth near 1.4pm corresponds to an H 2 O bandhead. 
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4). We tested two ditterent model pa- 
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(k 

rameterizations with CHlMhlKA. The first is the widely 
adopted approach of retrieving molecular abundances 
without any constraints from chemistry (e.g. Madhusud- 

i 





han & Seager 2009). For our second approach, we de- 
yeloped a new parameterization that retrieyes C/0 and 
atmospheric metallicity under the assumption of chemi- 
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Fig. 9. — High-precision optical/near-IR transit depth mea¬ 
surements for WASP-12b. We show measurements from 
i75T/STIS (green diamonds), Gemiraj/GMOS (orange squares), 
and HST /WFC3 (from this work, black circles; from staring mode 
data, pink circles). The best fit model from the CHIMERA FULL 
retrieval is indicated by the gray line. Note that this model was 
fit to the WFC3 data only. There may be small offsets between 
data sets due to different corrections for the companion star dilu¬ 
tion and the challenge of mea suring absolute transit de pths in the 
presence of systematic errors ( [Stevenson et al.|2014c|^ . 

Given the high precision of the WFC3 transmission 
spectrum, we can put more powerful constraints on the 
planet’s atmosphere than has been possible with past 
measurements. No previous data set for WASP-12b has 
shown conclusive spectroscopic evidence for molecular 
absorption, but the WFC3 spectrum has a noticeable 
increase in transit depth near the center of the water 
absorption band at 1.4 pm (see Figure]^. 

To quantify the water abundance and other atmo¬ 
spheric properties based on the transmission spect rum, 
we performed a retr ieval using the CHIMERA suite (Line 
et al. 2013b 2014). Modifications to the retrieval code 


cal equilibrium. We refer to these methods as the FREE 
approach and the CHEMICALLY-CONSISTENT (C-C) 
approach, respectively. Both methods are described in 
more detail below. We also present a com parison with 


results from the NEMESIS retrieval code (Irwin et al. 


20081 . 


We elected to focus on the WFC3 data for our re¬ 
trievals, based on the caveats for other data described in 
§1^ We treated the spectra from the two WFC3 grisms 
as a single data set, with no offset in transit depth be¬ 
tween them. The transit depths are consistent between 
the grisms because the influence of ste llar activity is be¬ 
low the measurement precision (see §2.1), so no offset 
is needed. We also te sted i ncluding STTS data in some 
of the retrievals (see §5.1.2), but found that it does not 
affect the main conclusions of this work. 

5.1. FREE Retrieval with CHIMERA 

The FREE parameterization retrieves molecular abun¬ 
dances, cloud and haze properties, and the altitude- 
independent scale height temperature. 

Our model explored the dominant molecular opacities 
expected for a solar composition gas at the temperatures 
and pressures probed by the observations. These are 
H 2 O, TiO, Na, a nd K over the wavelength range of the 
WFC 3 spectrum (Burrows & Sharp||1999 Fortney et al. 
20081. The remaining gas is assumed to be a solar com- 
position mixture of H 2 and He. We discuss results for 
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including additional molecular species in § |5.1.2| 

In addition to the molecular opacities, we included 
opacity from clouds and haze. Previous analyses of 
WASP-12b’s transmission spectrum have suggested that 
hazes are present in the atmo sphere, but the haze compo- 


2013 


Stevenson 


sition is poo rly constrained (Sing et ah, 
et al.|2014c ). We therefore used a flexible model that in¬ 
cludes a power law haze and an opaque gray cloud deck. 
We modeled clouds as a gray opacity source that masks 
transmission through the atmosphere below a fixed pres¬ 
sure level Pc- The haze opacity was parameterized by 
cr(A) = cro(A/Ao)’*', where the scatterin g amplitude an 


and slope 7 are fr ee parameters (as in Lecavelier Des 
Etangs et al.||200^. The scattering is presumed to hap¬ 


pen throughout the entire atmosphere as if it were a well 
mixed gas. The scattering amplitude is a scaling to the 
H 2 Rayleigh scattering cross section times the solar H 2 
abundance at 0.43 /rm. More sophisticated models for 
clouds and haze would require additional free parameters 
that are not justified by the precision of our data. Our 
simple parameterization is sufficient to capture degen¬ 
eracies between clouds/haze and the water abundance, 
which is the primary goal of this investigation. 

We also retrieved a “scale height temperature” param¬ 
eter Tg. We assumed the atmosphere is isothermal, with 
temperature equal to Tg at all pressures. We tested 
fitting for a more complex temperature-pressure pro¬ 
file but found it did not affect our results because the 
transmission spectrum is only weak ly sensitive to the at¬ 


mosp here’s thermal structure (see Barstow et al.j 2013 


20141. 

The final free parameter in the retrieval was a scale fac¬ 
tor for the planet’s 10 bar radius. We assumed a baseline 
plan et radius of 1 . 79 R^ and a stellar radius of 1.57 Rq 
from 
by a 


Hebb et al. (2009). The planet radius was scaled 
actor Rscaie to account for uncertainty in the pres¬ 
sure level in the atmosphere at a given radius. To hrst 
order, the effect of this scaling is to shift the model tran¬ 
sit depths by a constant factor. A second order effect is 
that scaling the radius chan ges the amplitude of spectral 


featu res (see Equation 1 in Lecavelier Des Etangs et al. 


2008). 

In sum, the retrieval had 8 free parameters: the abun¬ 
dances of H 2 O, Na-I-K (fixed at the solar abundance ra¬ 
tio), and TiO, as well as clouds and haze, the scale height 
temperature, and the planet radius scale factor. We re¬ 
fer to this 8 -parameter fit as the FULL model. The best 
fits for the FULL model and nested models within it are 
shown in Figure[^ The best-fit models are those that 
produced the lowest values in the MCMC. We list 
the retrieved water abundances, temperatures, and best 
fit values for these models in Table® The distribu¬ 
tions of retrieve d p arameters from the FULL model are 
shown in FigurejTO We note that all of the nested mod¬ 
els are nearly indistinguishable near the water absorp¬ 
tion feature at 1.4 /rm. The best constrained molecular 
abundance is that of water: we retrieved a water volume 
mixing ratio (VMR) of 1.6 x 10“"^ — 2.0 x 10“^ at la, 
which is consistent with expectations for a solar compo¬ 
sition gas. The other molecular abundances are not as 
well constrained. We retrieved a 3 cr upper limit on the 
VMR of TiO equal to 2 x 10“^, and the abundance of 
Na-|-K is unbounded. The cloud and haze properties are 
also poorly constrained. We discuss the implications of 




Fig. 10. — Best fits for different assumed models (lines) compared 
to the WFC3 transmission spectrum (points). Models are binned 
to a wavelength resolution of 0.01 /im. Results from the FREE and 
C-C parameterizations are shown in the top and bottom panels, 
respectively. 

these measurements for the atmosphere composition in 
detail in §[^ 


5.1.1. Detection Significances 

We performed nested model selection to identify how 
strongly each opacity source is detected in the spectrum. 
The standard Bayesian approach for comparing models is 
to use Bayes f actors, which can be converted to detection 
significances dSellke et al. ||2001[ |Trotta||2008|). See also 
Benneke & Seager ( |2013| ^ an application of nested 


Bayesian model comparisons as applied to super-Earth 
atmospheres. Bayes factors are the ratio of the Bayesian 
evidence (marginal likelihood) of the two models under 
consideration. The evidence is a multidimensional inte¬ 
gral over the entire posterior volume, a non-trivial cal¬ 
culation, so here we use two methods to approximate 
the integral. The fir st is the Numeri cal Lebesg ue Al¬ 
gorithm de scribed in Weinberg (2012) (see also Swain 
let al.||20T4l for an application to exoplanet atmospheres 
and a comparison to t he Ay^ test). The second is the 
Laplace approxim ation (Kass & Raftery|1995 Cornish & 
Littenberg|2007 ). Most methods for coniputing Bayesian 


evidence diverge for low detection significances, but they 
agre e well for highly significant detections (see Fig. 3 
from Cornish & Littenberg||2007 ). 

Using these techniques, W determined the detection 
signihcances for the following parameter combinations: 
H 2 O, absorbers other than H 2 O, clouds and haze com¬ 
bined, clouds only, haze only, and all opacity sources 
other than H 2 O. We computed the detection significance 
for each nested model by calculating its Bayes factor rel¬ 
ative to the FULL model. The results are shown in Ta¬ 
ble]^ If a parameter has a low detection significance 
value (^ 3 cr), that parameter does not provide a statis¬ 
tically significant improvement to the model fit. 

We hnd that H 2 O is detected at high confidence in the 
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Fig. 11.— A pairs plot of the distribution of parameters retrieved with CHIMERA for the FULL model. The parameters are the scale 
height temperature Ts (in Kelvin), the planet radius scale factor Rgcaiej the cloud-top pressure Pc (in log bars), the logarithm (base 
10) of the molecular abundances, and the haze scattering amplitude cro and slope 7 . The off-diagonal plots show marginalized posterior 
probability density for pairs of parameters (darker shading corresponds to higher probability). The diagonal plots show marginalized 
posterior probability distributions for individual parameters, with the median and 68 % credible interval marked with dashed lines. 


spectrum {7 a). We also find that the presence of all 
other opacity sources combined (clouds, haze, TiO, Na 
and K) is significant at > 3 tr. However, no other com¬ 
bination of these opacities is significantly detected (e.g., 
the detection significance for clouds alone is < 2 cr). The 
reason for this is that the parameters are degenerate. 
For example, both clouds and haze both have the effect 
of truncating the height of the water feature. Similiarly, 
haze, the wings of the alkali metal lines, and presence 
of TiO can all contribute to the rise in transit depths 


towards the blue end of the spectrum. Therefore, if a 
few of these opacity sources are removed, the others can 
compensate. However, removing all opacities besides wa¬ 
ter results in a sig nificantly poorer model fit (as can be 
seen in FigureflM. We therefore focus on results from 
the FULL modelbecause it accounts for the likely pres¬ 
ence of some combination of these other species and the 
degeneracies they may have with H 2 O. 

5.1.2. Retrievals with Additional Molecules/Data 
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TABLE 4 

FREE Retrieval Results 


Scenario^ 

Water abundance^ 
(VMR) 

Temperature^ 

(Kelvin) 

^2c 

Weinberg 
significance*^ (cr) 

Laplace 
significance ((t) 

FULL {Ts, Rscaie, Pc, H 2 O, TiO, Na-I-K, cro, 7) 

1.5 X IQ-'* - 2.2 X 10-2 

1040 - 1870 

2.82 

_ 

_ 

No water [Ts, Rscaie, Pc, TiO, Na-I-K, ao, 7 ) 

- 

130- 1560 

47.0 

6.9 

7.1 

No cloud, haze, other absorbers H 9 O) 

1.1 X 10“® - 1.5 X 10-2 

730- 1170 

11.1 

3.2 

4.6 

No other absorbers Pr, H^O, crn, ■y) 

6.9 X 10-® - 2.9 X 10-2 

1090 - 1890 

3.59 

1.2 

2.6 

No haze (T^, Rscaie, Pc, H 2 O, TiO, Na-I-K) 

5.8 X 10-5 - 8.3 X 10-3 

910-1470 

2.87 

1.6 

2.5 

No cloud (Ts, Rscaie, H 2 O, TiO, Na-I-K, cro, 7 ) 

6.6 X 10-5 - 2.0 X lO-'* 

1000 - 1840 

2.86 

- 

2.0 

No cloud/haze (Ts, fiscaie; H 2 O, TiO, Na+K) 

2.4 X 10-5 - 5.5 X 10-3 

860 - 1430 

2.95 

- 

1.2 


^ Indicates the parameters that are removed from the FULL model. The remaining parameters are shown in parentheses. 

^ The range of values corresponds to the 68% credible interval centered on the median retrieved value. 

^ values are calculated for the best fit model. 

Scenarios where the detection significance is undefined (because the Bayes factor is less than 1) are marked by —. The detection significance for 
the FULL model is also undefined because significances are defined relative to it. 


We explored many different combinations of models 
and data sets before we arrived at the analysis presented 
above. One option we considered was to fit a WFC3 spec¬ 
trum with smaller wavelength bins. We created a “high- 
resolution” spectrum with R = 55 — 70, in contrast to 
the “low-resolution” data (i? = 15 — 25) we used for our 
final modeling. The spectra are qualitatively similar, but 
the models fit the low-resolution data better {xt = 0.6 
versus 1.5). The high-resolution spectrum has larger ran¬ 
dom scatter around the best fit model, but no systematic 
trends indicating that the model is missing a particular 
physical effect. The scatter could be due to undiagnosed 
systematics at the pixel level in the data, which is reme¬ 
died by increasing the number of pixels that contribute 
to each spectral channel. 

The retrieved water abundances for both resolutions 
are very similar (1.5 x 10“"*^ — 2.2 x 10“^ and 1.2 x 10“"* — 
1.8 X 10“^ for low- and high-resolution). On the other 
hand, the best fit scale height temperatures differ by 1.4 cr 
( 13701 * 134 ° versus 2020+14° for low/high). This difference 
is due to slight changes in the morphology of the water 
feature between the data sets. It illustrates how retrieval 
results can be biased or have underestimated uncertain¬ 
ties when the model does not provide a good fit to the 
data, either because of missing model physics or under¬ 
estimated data error bars. 

Another test we performed was to fit the combined 
transmission spectrum from HST /STIS and WFC3. The 
STIS spectrum covers optical wavelengths and exhibits 
a linear increase in transit depth from red to blue (see 
Figure]^. Incorporating the STIS data in the retrieval 
increased the detection significance of clouds and haze 
to > 3 cr (because of the rise in transit depth towards 
the blue), but otherwise did not significantly change the 
results. 

We also tested including additional absorbing species 
in the retrieval. Previous work has suggested WAS P-12b 
has a carbon-rich atmosphere composition (e.g. |Mad- 


Sharp 1999 Fortney et al. 2008 
h'or this retrieval, we also ht the S 


Madhusudhan 2012). 
.IS data and used the 


husudhan et al. 2011 b), so we wanted to confirm that 


our estimate of the water abundance is not biased by 
only including absorbers expected for solar composi¬ 
tion. We therefore ran a retrieval including all the 
major opacity sources expected for either oxygen-rich 
or carbon-rich compositions, including H 2 O, CO, CO 2 , 
NH 3 , TiO, VO, Na, K, CH 4 , C 2 H 2 , HCN, H 2 S' FeH, N 2 , 


higher-resolution WFC3 spectrum. The retrieved water 
abundance of 2.5 x 10“^ — 2.0 x 10“^ is nearly identical 
to the results we obtained from the FULL model. No 
individual absorbers were significantly detected besides 
H2O. 

Our conclusion from all the scenarios we fit is that the 
constraints on the water abundance are not significantly 
impacted by the choice of data sets or the absorbers in¬ 
cluded in the modeling. 

5.2. CHEMICALLY-CONSISTENT Retrieval with 
CHIMERA 

In addition to ret rieva ls with the FREE parameteriza¬ 
tion (described in §5.1), we also developed a reparam¬ 
eterized model to retrieve atmospheric properties that 
are consistent with chemical equilibrium. Rather than 
varying the mixing ratios of individual gases, we made 
C/0 and metallicity free parameters and computed ther¬ 
mochemical equilibrium abundances along the temper¬ 
ature/pressure (T/P) profile for major species. We re¬ 
trieved the T/P profile rather than a scale height temper¬ 
ature in order to explore any additional degeneracies due 
to the profile shape. The T/P profile is modeled with a 
5-parameter analytic function (|Line et al. 2013b I which 
produces physically realistic th ermal profiles consistent 
with radiative equilibrium (e.g. |Guillot 2010 Robinson 
fc Catlin^|2012[ |Heng et al.| 2012(~ Parmentier fc Ouil- 
lot||2014p. We calculated chemical prohles for H 2 , He, 
T^CJ-Ck^, CO, CO 2 , NH 3 , H 2 S, PH 3 , C 2 H 2 , HCN, 
Na, K, FeH, TiO, VO, and N 2 to cover the full range 
of gases that could contribute significant opacity in the 
near-IR. The equilibrium abundances are computed us- 


code 

McBride & Gordon 

1996 Moses et al. 20111 

et al. 

2UIi 

). The chemica 

prohles were ted into tn 


spectra for comparison with our data. The model also 
included opacity from cl ouds and haze (using the same 


and collisionally-induced H 2 /He absorption (Burrows & 


formalism described in §5.1), and the reference radius 
Rp, for a total of 11 free parameters. While we re¬ 
duced the number of free absorber parameters to just 
C/0 and metallicity, the overall number of free parame¬ 
ters increased due to the more flexible T/P profile. 

We ran retrievals using three different priors for C/0. 
The fiducial case had an uninformative prior constraint 
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TABLE 5 

C-C Retrieval Results 


Scenario 

C/O 

Metallicity 
(x solar) 

Water abundance 
(Icr) 

Temperature^ (K) 
(Icr) 


Fiducial 

0 -rich 

C-rich 

0.2-0.7 
0.1-0.7 
2.9-51 

0.3 - 20 

0.3 - 30 

0.2 - 80 

9.3 X 10-5 - 5.3 X 10-5 
1.2 X 10-"^ - 1.5 X 10-2 
1.5 X 10-5 - 8.6 X 10-5 

1090 - 1760 

1090 - 1890 

120 - 550 

2.43 

2.75 

29.2 


^ The temperature range corresponds to a pressure level of 1 mbar. 



Fig. 12.— Retrieval results from the C-C parameterization. The left and middle panels show the marginalized distribution for C/O and 
metallicity. The right panel shows the median (black lines) and 1 a range (shaded regions) of 1000 randomly sampled temperature-pressure 
profiles for each scenario. Yellow, blue, and red shading correspond to constraints from the fiducial scenario (an uninformative prior on 
C/O), the oxygen-rich scenario (C/O < 1), and a carbon-rich scenario (C/O > 1). The distribution of C/O values for the carbon-rich model 
is normalized to have a probability mass of unity over the plotted range; however, the distribution has an extended tail toward higher C/O 
values that is not shown. 


on C/O. We also ran retrievals that constrained the 
atmospheric composition to be either oxygen-rich or 
carbon-rich. For the 0-rich scenario, the prior proba¬ 
bility was set to zero for C/O values greater than unity. 
Correspondingly, the C-rich scenario had zero prior prob¬ 
ability for C/O < 1. We show the best fit models for all 
three cases in Figure[^ Figurep^ shows marginalized 
distributions of C/O and metallicity for each scenario, 
as well as constraints on the temperature-pressure pro¬ 
files. Tablej^ gives the values of the best fits in each 
scenario. It also lists the 68% credible intervals for C/O, 
metallicity, and the temperature and water abundance 
at 1 mbar pressure. 

The fiducial and oxygen-rich scenarios give nearly iden¬ 
tical constraints on C/O and the atmospheric metallic¬ 
ity. Even though the fiducial model has an uninforma¬ 
tive prior on C/O, 100% of the retrieved C/O values 
are less than unity for this case (suggesting the result 
is data-driven rather than prior-driven). Both scenar¬ 
ios yield constraints on the molecular abundances and 
thermal profile that agree well with results from the 
FULL model. The median retrieved water abundances 
are 6.3 x 10“^ and 1.0 x 10“^ for the fiducial and 0-rich 
scenarios, respectively. The median 1 mbar temperatures 
are 1410 and 1450 K. We retrieved metallicities in the 
range 0.3 — 30x solar (at 1 a). The retrieved cloud and 
haze properties for these scenarios are consistent with 
the FULL model at 1 a. 

By contrast, the carbon-rich scenario produced signif¬ 


icantly different atmospheric properties. To reproduce 
the water absorption feature in the spectrum, the tem¬ 
perature was driven to much lower values (the median 
is 320 K at 1 mbar). These low temperatures allow for 
higher water abundances (see Figurefl^, though they 
are unlikely for the terminator region given how highly 
irradiated the planet is. The median water abundance, 
1.4 X 10“"*, is comparable to that for the 0-rich and fidu¬ 
cial scenarios. On the other hand, the median methane 
abundance increases to 2.4 x 10“^, versus 1.3 x 10“^^ 
for the fiducial case. These differences have two main 
effects on the model transmission spectra. One is that 
absorption features have smaller amplitude because lower 
temperatures decrease the atmospheric scale height. The 
second effect is that methane absorption is present in the 
spectrum. It is especially noticeable in the window be¬ 
tween water features at > 1.6 um wavelengths (see the 
best-fit C-rich model in Figure 10). The consequence of 
these changes is that C-rich models do not fit the mea¬ 
sured spectrum as well as 0-rich models. We discuss the 
strength of the evidence for one scenario over the other 
in quantitative detail in §1^ 


As a test, we also con^ered a scenario with clouds 
and haze removed. We computed the Bayes factor for 
this nested model and found a detection significance of 
3.7 tr for the cloud and haze parameters. The detection 
significance is higher than for the FREE model param¬ 
eterization because the assumption of chemical equilib¬ 
rium breaks the degeneracy between the cloud/haze and 
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the other molecular abundances (Na+K, TiO); i.e. the 
other molecular abundances can’t increase arbitrarily to 
make up for the absence of clouds/haze. 

5.3. Retrieval with NEMESIS 

We also compared the results from CHIMERA with 
output from an independent retrievel code to test the 
robustness of our measurements against a different mod¬ 
eling approach (optimal estimation versus MCMC). We 
fit the high-resolution WFC3 spectrum with the NEME¬ 
SIS code (jlrwin et al.||2008|). The model included H 2 O 
as the only molecular opacity source. We modeled the 
atmosphere’s thermal structure as a scalar multiple of 
the dayside te mperature-pressure profile from [Stevenson 
et al. (2014b). NEMESIS does not currently incorpo¬ 
rate cloud-top pressure as a free parameter, so we fixed 
the altitude of an opaque gray cloud deck over a grid of 
pressures ranging from 10 to 10“^ mbar and retrieved 
the atmospheric properties for each case (for further dis- 
cus sion of retrieving clou d properties with NEMESIS, 
see Barstow et al. 2013). The best fit model had a 
cloud deck at 1 mbar and a retrieved water abundance 
of 4.0 X 10“"’’ — 1.7 X 10“'^ at Icr. Models with higher 
altitude cloud decks (0.1 - 0.001 mbar) achieved nearly 
as good a fit, and had 1 a upper bounds on the water 
abundance of 3 x 10“^. These results are in excellent 
agreement with the results from CHIMERA. 

6 . IMPLICATIONS FOR ATMOSPHERIC C/O 

In this section, we quantify the strength of the evidence 
for oxygen-rich compositions (C/O < 1) over carbon-rich 
compositions (C/0> 1) based on our retrieval results 
from two separate modeling approaches. We focus Hrst 
on constraints from the water abundance from the FREE 
retrieval, since water is the only molecule that is unam¬ 
biguously detected in the spectrum. We then discuss 
constraints from the C-C model, which retrieves C/O 
directly under the assumption of chemical equilibrium. 

6 .1. Constraints from the FREE Retrieval 

Broadly speaking, a carbon-rich atmosphere is ex¬ 
pected to have lower water abundance than an oxygen- 
rich atmosphere, because most of the oxygen atoms are 
boun d in CO in chemical equilib rium at high tempera¬ 
tures (jMadhusudhan et al.|2011a|) . Our water abundance 
measurement is a qualitatively better match to predic¬ 
tions for an oxygen-rich composition. In Figure p^ we 
show the retrieved H 2 O abundance for the FULL re¬ 
trieval and nested models in comparison with thermo¬ 
chemical equilibrium predictions for oxygen-rich (C/O = 
0.55 = solar) and carbon-rich (C/O = 1) atmospheres. 
The predicted abundance depends on pressure and tem¬ 
perature, so we plot the span of predictions over pressures 
from 0.1 to 10 mbar as a function of temperature. 

To quantitatively compare our measurement to the 
models, we must marginalize over the uncertainty in the 
pressure and temperature probed by the observations. 
To do this, we compute an equivalent pressure level Peq 
corresponding to an optical depth r = 0.56 at 1.4 /im at 
each step in the MCMC chain. This quantity is repre¬ 
sentative of the typical pressure level at which photons 
are absorbe d by water molecules (Lecavelier Des Etangs 
et al. 2008). Figure 14 shows the distribution ol equiv¬ 
alent pressures from the MCMC chain. Note that peg 


is not a retrieved quantity, but rather derived from the 
opacities at each MCMC step. We show the distribu¬ 
tion of e quiv alent pressures obtained from this method 
in Figure[T4| The pressures have a 1 cr range of 0.05 to 5 
mbar, with a peak at 0.5 mbar. 

For a specified atmospheric composition, we can then 
calculate the predicted equilibrium H 2 O abundance for a 
temperature and pressure equal to Tg and Peq- This pro¬ 
cedure yields a “calculated” water abundance to compare 
with the “observed” water abundance at each step in the 
MCMC chain. We show the distribution of observed mi¬ 
nus calculated (“O - C”) values for the oxygen-rich and 
carbon-rich models in Figured^ (right panel). 

To test how well each modelagrees with the retrieved 
H 2 O abundance, we assess whether the 0-C distribution 
is consistent with zero. For the oxygen-rich model, zero 
is contained in the 1 cr credible interval centered on the 
median, indicating that this model is a good match. By 
contrast, the 0-C distribution for the carbon-rich model 
is in tension with zero at approximately 2cr (the 95% 
credible interval is 0.1 — 6.4). The median 0-C value is 
3.7, implying that the typical retrieved water abundances 
are nearly four orders of magnitude larger than predicted 
for a carbon-rich composition. 

We emphasize that these results are strongly depen¬ 
dent on the temperature and pressure probed by the ob¬ 
servations. Assuming a different local thermal profile 
can result in order-of-magnitud e differences in the p re- 
dicted abundances (cf. Figure 2, |Madhusudhan|2012 ). It 
is therefore essential to account for the temperature and 
pressure (and their uncertainties) when estimating C/O. 

6 .2. Constraints from the C-C Retrieval 

We obtain more stringent constraints on the atmo¬ 
spheric C/O from the C-C retrieval than from the FREE 
parameterization. By every metric we enumerate below, 
C/O values greater than one are ruled out at high conh- 
dence. First, the entire posterior probability distribution 
for C/O from the fiducial model is less than one. We cal¬ 
culate that there are roughly 3000 independent samples 
in the MCMC chain for that model, which is a sufficient 
number to rule out C/O > 1 at greater than 3 a con¬ 
fidence. Second, the values for the best ht models 
provide additional evidence in favor of the 0-rich sce¬ 
nario. The best fit 0-rich model has x^ = 2.75 versus 
x’^ = 29.2 for the best fit C-rich case (for 2 degrees of 
freedom). This difference in x^ implies the 0-rich model 
is 10® times more likely than the best fit C-rich model, as¬ 
suming Gaussian statistics. Third, the Bayes factor (the 
ratio of integrated posterior probabilities) for the C-rich 
scenario to the 0-rich scenario is 14, which constitutes 


stron g evidence in favor of the 0-rich model (Jeffreys 
1998|). In addition, the fact that the temperature range 
drops unphysically low - below the condensation temper¬ 
ature of water - is further evidence that the carbon-rich 
model is not appropriate for these data. 

Taken together, these results rule out a carbon-rich 
composition for the atmosphere at high confidence. This 
is a more dehnitive constraint than what we obtained 
from studying the water abundance alone because the 
model is sensitive to both the presence of water and the 
absence of absorption features from other molecules. For 
example, the fact that no methane features are detected 
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Fig. 13.— Left: Measurement of the water abundance and scale height temperature for WASP-12b (points) compared to equilibrium 
chemistry predictions of the water content for different atmospheric compositions (lines and shading). The black point indicates the water 
abundance and temperature measurements from the FULL model from the CHIMERA fit to the WFC3 spectrum. Results from other nested 
models are shown in gray. The black dashed line and blue shading correspond to water abundance predictions for a solar C/O composition, 
and the black dotted line and red shading correspond to C/O = 1. Both models have solar metallicity. For each model composition, 
the shading shows the span of predicted water abundances over pressures ranging from 0.1-10 mbar. The black lines correspond to 1 
mbar, which is the typical pressure level probed by our observations. Right: Histogram of observed minus calculated water abundances 
for the FULL retrieval results relative to different model compositions. The red histogram (dotted line) shows the comparison with a 
carbon-rich model, and the blue histogram (dashed line) shows results for the oxygen-rich model. The gray vertical line marks zero, where 
the observations are an exact match to the model predictions. 



Fig. 14.— Histogram of equivalent pressures corresponding to an 
optical depth of 0.56 at 1.4 fim for each step in the MCMC chain 
for the FULL retrieval. The dotted lines indicate the median and 
surrounding 68% confidence interval. 


ity rather than molecular abundances. Our conclusions 
about the nature of the planet are summarized as follows: 

1. Water is present in the atmosphere. Models that 
do not include water absorption are excluded at 
7 a confidence. This result is the first unambiguous 
spectroscopic detection of a molecule in the planet’s 
atmosphere. 

2. The 68 % credible interval for the retrieved water 
abundance is 10 “® — 10 “^ for a wide range of mod¬ 
els. This result applies to scenarios where water 
is the only absorber included in the model, where 
clouds and haze are added, where additional ab¬ 
sorbers are incorporated (CO, CO 2 , NH 3 , TiO, 
VO, Na, K, CH 4 , C 2 H 2 , HCN, H 2 S, FeH, and 
N 2 ), and where optical transit depth measurements 
from HST /STIS are also fit. This range agrees well 
with the predicted equilibrium water abundance for 
a solar composition atmosphere but is still consis¬ 
tent to within 2a with the prediction for a carbon- 
rich composition. 


in the spectrum strengthens the case for an 0 -rich com¬ 
position beyond the constraints from the presence of wa¬ 
ter alone. The C-C model also assumes more prior knowl¬ 
edge by imposing chemical equilibrium in the calculation 
of the model spectra. 


7. SUMMARY & CONCLUSIONS 

We have measured a precise transmission spectrum for 
the hot Jupiter WASP-12b over the wavelength range 
0.84 to 1.67/rm with HST/WFC3. The transmission 
spectrum is a factor of three more precise than previ¬ 


ous m easurements in this wavelength range (Swain et al. 


20131. We retrieved the atmospheric properties based on 


this spectrum with a variety of models, including a new 
retrieval parameterization that fits for C/O and metallic- 


3. Stellar photons are absorbed at temperatures of 
1000 — 1900 Kelvin and pressures of 0.1 — 10 mbar 
(based on 1 ct ranges for Tg and Peq from the FULL 
model). These estimates have large uncertainties 
because the transmission spectrum is only weakly 
sensitive to the thermal structure of the atmo¬ 
sphere. 

4. Based on a new retrieval parameterization that 
fits for C/O and metallicity under the assumption 
of chemical equilibrium, we constrain the C/O to 
O.hlg'g at 1 cr and rule out a carbon-rich atmo¬ 
sphere composition (C/O > 1) at > 3 cr confidence. 
With this model, we also detect the presence of 
clouds/haze at 3.7 tr confidence. 
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Our constraint on C/0 is in tension with past studies 
of the planet’s atmosphere that found a carbon-rich com¬ 
position was the be st explanation for the dayside emis¬ 


sion spectrum (e .K- Madhusudhan et al. 2011b Steven- 
son et al.||2014b|). We would hope tor better agreement 


between these analyses, as both results are based on 
very high precision data fit with state-of-the-art retrieval 
models. This methodology has yielded consistent results 
for other planets, notably the hot Jupiter WASP-43b, 
which shows excellent agreement between estimates of 
the composition from the dayside emis sion and trans¬ 
mission spectra (Kreidberg et al.||2014a). 


son et al. 2014d Kataria et al. 2014, 20151, so light curves 
in absorption bands (which probe lower pressures) are ex¬ 
pected to have larger amplitudes than those in spectral 
windows. Measuring the wavelength dependence of the 
phase curve amplitude could therefore provide a n addi- 
tional constraint on the atmospheric comp osition ([Show¬ 
man et al. 1120091 |Stevenson et al.||2014d| |Kataria et al. 
2014[ |2015|). 


We note that a caveat for our results from the C-C 
model is our ass umption that the a tmosphere is in chemi¬ 


cal equilibrium. Moses et al. (2013) considered the effects 
of photochemistry and mixing tor WASP-12b and found 
that the water abundance is either unchanged from equi¬ 
librium values (for solar composition), or pushed even 
lower (for a C/0 = 1 composition). Based on these re¬ 
sults, it is unlikely that our high observed water abun¬ 
dance is due to disequilibrium chemistry in a carbon-rich 


Pmaliy, our results highlight the necessity of ob¬ 
taining high-precision data with multiple observing 
techniques (transmission spectroscopy, dayside emission 
spectroscopy, and phase curves) in order to obtain ro¬ 
bust constraints on the rich chemistry and physics of exo¬ 
planet atmospheres. Studying the atmosphere from more 
than one angle (literally) is key to providing a detailed 
understanding of its thermal structure and dynamics, 
which is needed to unambiguously determine the compo¬ 
sition (and vice versa). In addition, high-precision mea¬ 
surements are essential for revealing the small features 
present in exoplanet spectra. The amplitude of these 
features has often been smaller than predicted for tran- 


atmosphere. However, these calculations were for the 

siting planets, due to the presence of clouds or haze and 

planet’s dayside and it would be worth exploring dise- 

shallower thermal prohles (e.g. 

Fortney et al. 2006 Char- 

quilibrium effects for the terminator region specifically 

bonneau et al. 2002 

Deming et al.|2013p. Uur WASP-12b 


Another assumption in our model is that the temper¬ 
ature structure is well approximated as ID. In reality, 
a photon’s slant path through the atmosphere traverses 
many different temperatures and pressures. However, 
theoretical models predict that most of the stellar radi¬ 
ation is attenuated in a fairly localized region wi thin a 
few degrees of the terminator ( Fortney et al.|[Ml0 ). The 


change in temperature and pressure over a region this size 
is small compared to the uncertainty in our estimates of 
the thermal profile, so our assumption of a ID profile is 
unlikely to bias our assessment of the atmospheric com¬ 
position. On the other hand, the observations integrate 
over the entire limb of the planet, and there could be 
large variations in temperature over this region due to 
atmospheric dynamics. It is therefore possible that fit¬ 
ting the limb-averaged spectrum with a single ID ther¬ 
mal profile could bias the results. 

One route to reconciling the disagreement between 
constraints on C/0 from the transmission and emis¬ 
sion spectra is to determine the planet’s global ther¬ 
mal structure. This could be achieved by combining 
phase-resolved emission spectroscopy of the planet with 
3D atmospheric circulation modeling, as has been done 


for W ASP-43b (Stevenson et al. 2014d Kataria et al. 
20151. This cornbination would provide an independent 


measure of the temperature structure at the termina¬ 
tor to strengthen our interpretation of the transmission 
spectrum. It would also put the dayside temperature- 
pressure profile in context of the planet’s global heat cir¬ 
culation and aid in understanding the dayside emission 
spectrum. In addition, a spectroscopic phase curve would 
allow us to estimate the water abundance for new regions 
of the planet’s atmosphere. Furthermore, the spectro¬ 
scopic phase curve amplitudes themselves could provide 
an additional diagnostic of the atmosphere composition. 
Day-night temperature d ifferences for hot Jup i ters are 
larger at lower pressures (Showman et al. 2009 Steven- 


surements, with features crossing just two scale heights. 
We therefore advocate applying an intensive approach 
of high-precision spectroscopy from multiple angles to 
a larger sample of transiting exoplanets to shed light 
on their nature and origins. Such measurements will 
help develop the observing strategies needed for definitive 
characterization of exoplanet atmospheres, and prepare 
the community to make robust measurements of poten¬ 
tially habitable worlds with future facilities. 
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